Increasing Weak Classifiers Diversity by Omics Networks
نویسنده
چکیده
The common problems in machine learning from omics data are the scarcity of samples, the high number of features and their complex interaction structure. The models built solely from measured data often suffer from overfitting. One of possible methods dealing with overfitting is to use prior knowledge for regularization. This work analyzes contribution of feature interaction networks in regularization of ensemble classifiers representing another approach to overfitting reduction. We study how utilization of feature interaction networks influences diversity of weak classifiers and thus accuracy of the resulting ensemble model. The network and its random walks are used to control the feature randomization during construction of weak classifiers, which makes them more diverse than in the well-known random forest. We experiment with different types of weak classifiers (trees, logistic regression, näıve Bayes) and different random walk lengths and demonstrate that diversity of weak classifiers grows with increasing network locality of weak classifiers.
منابع مشابه
Classifier Ensemble Framework: a Diversity Based Approach
Pattern recognition systems are widely used in a host of different fields. Due to some reasons such as lack of knowledge about a method based on which the best classifier is detected for any arbitrary problem, and thanks to significant improvement in accuracy, researchers turn to ensemble methods in almost every task of pattern recognition. Classification as a major task in pattern recognition,...
متن کاملEnsemble Selection Using Diversity Networks
An ideal ensemble is composed of base classifiers that perform well and that have minimal overlap in their errors. Eliminating classifiers from an ensemble based on a criterion that reflects poor classification performance and error redundancy with peer classifiers can improve ensemble performance. The Diversity Networks method asymmetrically evaluates each pair of classifiers as a linear combi...
متن کاملA Novel Ensemble Approach for Anomaly Detection in Wireless Sensor Networks Using Time-overlapped Sliding Windows
One of the most important issues concerning the sensor data in the Wireless Sensor Networks (WSNs) is the unexpected data which are acquired from the sensors. Today, there are numerous approaches for detecting anomalies in the WSNs, most of which are based on machine learning methods. In this research, we present a heuristic method based on the concept of “ensemble of classifiers” of data minin...
متن کاملGreedy optimization classifiers ensemble based on diversity
Decreasing the individual error and increasing the diversity among classifiers are two crucial factors for improving ensemble performances. Nevertheless, the ‘‘kappa-error’’ diagram shows that enhancing the diversity is at the expense of reducing individual accuracy. Hence, a newmethod namedMatching Pursuit Optimization Ensemble Classifiers (MPOEC) is proposed in this paper in order to balance ...
متن کاملExamining the Relationship Between Majority Vote Ac - curacy and Diversity in Bagging and
Much current research is undertaken into combining classifiers to increase the classification accuracy. We show, by means of an enumerative example, how combining classifiers can lead to much greater or lesser accuracy than each individual classifier. Measures of diversity among the classifiers taken from the literature are shown to only exhibit a weak relationship with majority vote accuracy. ...
متن کامل